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Abstract 

A family of probability distributions (i.e. a statistical model) is said to 
be sufficient for another, if there exists a transition matrix transforming 
the probability distributions in the former to the probability distributions 
in the latter. The Blackwell-Sherman-Stein (BSS) theorem provides nec- 
essary and sufficient conditions for one statistical model to be sufficient 
for another, by comparing their information values in statistical decision 
problems. In this paper we extend the BSS theorem to quantum statistical 
decision theory, where statistical models are replaced by families of den- 
sity matrices defined on finite-dimensional Hilbert spaces, and transition 
matrices are replaced by completely positive, trace-preserving maps (i.e. 
coarse- grainings). The framework we propose is suitable for unifying re- 
sults that previously were independent, like the BSS theorem for classical 
statistical models and its analogue for pairs of bipartite quantum states, 
recently proved by Shmaya. An important role in this paper is played 
by statistical morphisms, namely, affine maps whose definition general- 
izes that of coarse-grainings given by Petz and induces a corresponding 
criterion for statistical sufficiency that is weaker, and hence easier to be 
characterized, than Petz's. 

Keywords: comparison of experiments, Blackwell-Sherman-Stein theorem, statistical suffi- 
ciency, quantum statistical models, quantum information structures, statistical morphisms 

1 Introduction 

The task in which an experimenter tries to learn about the true value of an 
unknown parameter by observing a random variable whose distribution depends 
on such a value, is generally called a statistical estimation task or a statistical 
decision problem. The mathematical structure used to describe such a situation 
is called statistical model, i.e. a family of probability distributions (or, more 
generally measures) indexed by a parameter set, which represents the unknown 
parameter one wants to learn about in the estimation process. 

An important subject in classical statistics is the comparison of statisti- 
cal models in terms of their "information value" in statistical decision prob- 
lems [IJ|2li2Hls3ElEllBlEl[ID]- Within this area, one of the most important 
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results has been proved by Blackwell, Sherman, and Stein 31 [3J [5] : the theo- 
rem states equivalent conditions for one statistical model being more informative 
than another. More explicitly, the Blackwell-Sherman-Stein (from now on, BSS) 
theorem proves that one statistical model carries more information than another 
if and only if the former is sufficient for the latter, namely, if and only if there 
exists a transition matrix (a Markov kernel) mapping the probability distribu- 
tions (measures) in the former to the probability distributions (measures) in the 
latter. 

In quantum statistical decision theory [TTJ [T2] , where statistical models are 
replaced by families of non-commuting density operators (i.e. quantum statis- 
tical models), the notion of sufficiency has been introduced and developed by 
Petz [I31Q3], by replacing Markov kernels with completely positive (or, at least, 
two-positive) trace-preserving maps, i.e. coarse-grainings. However, the idea 
of applying to the quantum case concepts from the theory of comparison of 
statistical models a la BSS, like e.g. the concept of information value, has not 
been explicitly pursued until recently, in a work by Shmaya [T5]: there, partial 
ordering relations between pairs of bipartite quantum states (analogous, in a 
way, to the partial ordering relations used in the BSS theorem) are introduced, 
and an equivalence relation between such partial orderings is established. Sub- 
sequently, in [T2], Chefles reformulated Shmaya's result for the comparison of 
pairs of quantum channels. However, the equivalence relations proved in [T3] 
and [16] neither imply any criterion for the comparison of quantum statistical 
models, nor are they more general than the BSS theorem, with which they are, 
in fact, logically unrelated. This is due to the fact that both Shmaya and Chefles 
need, in their proofs, quantum entanglement: as such, their results are purely 
quantum and cannot be compared with the case of classical statistics, where 
quantum entanglement is not available. 

The aim of this paper is to bridge the gap mentioned above, by developing 
a general theory for the comparison of statistical models, which can be applied 
both to the classical and the quantum (i.e. non-commutative) setting. In order 
to do this, it is mathematically convenient to relax the definition of sufficiency 
introduced by Petz [THl E] and define a weaker notion of sufficiency, which we 
call m- sufficiency, based on the concept of statistical morphism^. Statistical 
morphisms arc affine maps satisfying the minimum requirements necessary to 
make them meaningful in a statistical sense: in fact, as we will carefully argue 
in what follows, even the requirement of positivity can be lifted, without com- 
promising the formalism. In spite of their generality, statistical morphisms are 
sufficiently well-behaved, so that, in some cases, they can be extended to com- 
pletely positive coarse-grainings. This fact is proved in two extension theorems, 
of crucial importance in this paper, analogous to those proved for positive maps 
by Choi (Theorem 6 in [TB]) and Arveson (Proposition 1.2.2 in [TT?]). 

The generality of the definition of statistical morphisms makes the main re- 
sult proved here applicable to both commutative and non-commutative scenar- 
ios. When specialized to the classical setting, our result provides an alternative 
proof of the BSS theorem, while, in the quantum setting, an equivalent charac- 
terization of Petz's sufficiency criterion is obtained. An intermediate, 'hybrid' 
quantum-classical case is also considered and completely characterized. We are 

lr The term "statistical morphism" has been introduced in the classical setting by Morse 
and Sacksteder 1171 . In this paper we use the same term, but in a non-commutative setting. 
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also able to recover Shmaya's result as a special case, although here, in contrast 
with Refs. |15j and [16] , we never need to resort to any additional entangled 
resource. 

The paper is organized as follows: in Section [5] we briefly review the notions 
of statistical models, statistical decision problems, and comparison of statistical 
models in classical statistics. In Section [3] we introduce some basic definitions, 
extending the idea of comparison of statistical models to finite dimensional quan- 
tum systems. In Section 31 we introduce the notions of statistical morphisms 
and m-sufficiency. In Section [5] we prove two extension theorems for statisti- 
cal morphisms. Section [5] contains the main result, which is then applied, in 
Section[?l in order to recover the BSS theorem, characterize a semi-classical sce- 
nario, and obtain an equivalent characterization of Petz's sufficiency relation. 
Section [5] deals with the scenario originally considered in Ref. [TS] and the result 
proved by Shmaya is recovered without the need of any entangled auxiliary re- 
source. Finally, Section concludes the paper with the summary of its contents 
and one remark about generalized probabilistic theories. 

2 Classical formulation 

A (finite) statistical model § is defined by a triple (0,A,a:), where O is a 
(finite) parameter set {0}eee, A is a (finite) sample set {S}seA, and a is a 
family (pg]9 £ 0) of probability distributions pg on A, i.e., po(8) 3* and 
^2s e /^Pe{S) = 1. In the following, it will sometimes be convenient to think of 

each pe as a |A|-dimensional probability vector pe = {p\ , ■ ■ ■ , Pg , ■ • ■ , Pg A '), 
whose components are defined as p\ := pg(5). 

Remark 1. In many relevant situations, A can be considered as the set of possi- 
ble states of a physical system, so that the probability distribution pg becomes 
the statistical description of the state of the system. This point of view, which 
is the guiding one in Ref. [TT], will be implicitly adopted here as well. 

A statistical decision problem is defined by a triple (<?,X,£), where § — 
(0, A, a) is a statistical model, X is a (finite) decision set {i}i^x, and £ : <d x 
X — > 1R is a payoff function. The decision problem works as follows: upon the 
observation (or state) 6 g A, occurring with probability pg(5), the statistician 
performs a decision, namely, he applies a X-decision function u : A — > X, gaining 
a payoff (or suffering a loss, if negative) of £(9, i), depending on the "true" law 
of nature 9 that determined the observed state 8. The choice of the function 
u : A — > X corresponds to the experimenter's choice of a strategy. 

The deterministic X-decision function u : A —> X is often generalized to a 
randomized "X-decision function (or X-r.d.f.) 0, which is a convex combination 
of X-decision functions, i.e., a function mapping each 8 G A to a probability 
distribution t$ on X. A convenient way to represent a X-r.d.f. <j) is by giving 
conditional probabilities t<f,(i\S) > 0, i.e. non-negative real numbers such that 

Siex^(*l^) = 1' f° r all <5 £ A. 

Given a decision problem X,£), for each X-r.d.f. (f>, we introduce the 
payoff vector v(4>; X, £) 6 R' e ' , whose 9-ih. component, representing the payoff 
gained if the true law of nature is 9, is defined as 

v 6 ^ <?, X, I) := i) E Umpe{5). (1) 
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Then, the following set 

e(<f, X, l) := {w(0; <?, X, I) \<j> is a X-r.d.f. on A } (2) 

forms a (closed and bounded) convex subset of R' e , since it inherits the convex 
structure from the set of randomized decision functions. 

Let now & = (G, A', fj) be another statistical model, with the same param- 
eter set G as for but with a different sample set A' and a different family of 
probability distributions on A', f$ = (q$; 9 6 O). Also for J^", we can define, for 
each decision set X and each payoff function I : 8 x X — > ]R, the convex set of 
achievable payoff vectors as 

X, l) := {v(<t> f ; X, t) |0' is a X-r.d.f. on A' } . (3) 

In classical statistics, the following partial ordering between statistical mod- 
els with the same parameter set G is introduced (see, e. g., Ref. [3]): 

Definition 1 (Information Ordering). The statistical model S = (0,A, a.) is 
said to be always more informative than & = (G,A',/3), in formula S D 
if and only if, for every finite set of decisions X and every payoff function i : 

exi^R, e(g,x,£)D e(&,x,£). 

In other words, S is said to be more informative than if every payoff 
vector attainable in the problem (J^", X, £) is also attainable in the problem 
{$, X, £). The definition of information ordering between statistical models can 
be simplified as follows. Given a statistical model S = (G,A, a), for every 
decision problem (S ', X, i) and every X-r.d.f. 0, we define 

s(*, X, e,4>) := -L £ Ofi, i) E UmPo(S). (4) 
The maximum of s(#, X, £, (f) over all X-r.d.f. <fi is correspondingly defined as 

$(<?, X,£) : = max f 7^ E E ^> J ) E 
••II eeeiex <5eA 

i (5) 

1 1 eee 

In the Bayesian approach, when there is no compelling reason to treat the 
sample set differently from the parameter set, it is reasonable to interpret the 
factor 1/|G| as an a priori probability distribution over the unknown parameter 
£ G. In this framework, the function $(#,X, £) is understood as the optimal 
expected payoff, and the following partial ordering between statistical models 
governed by the same parameter set G is introduced (see, e. g., Ref. [10 ): 

Definition 2 (Bayesian Information Ordering). The statistical model S — 
(G, A, a) is said to be bayesianly always more informative than & = (G, A', /3), 
in formula $ DBaycs & ■> if and only if, for every finite decision set X and every 
payoff function I : G x X -> JR, %(S , X, I) > $(#", X, £). 

In other words, <§ is said to be bayesianly more informative than & if every 
expected payoff attainable in the problem (&, X, f) is also attainable in the 
problem (<?, X,£). In Appendix [Al we report the proof of the following basic 
fact: 
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Proposition 1. $ D & if and only if S Deaycs & • 

Since the two orderings S D & and § DBaycs & are equivalent, from now 
on we will keep only the notation S D JP, stressing the fact that the "Bayesian 
information ordering" relation given in Definition [2] does not really depend on 
any strong Bayesian assumption. 

Another partial ordering between statistical models with the same parameter 
set is relevant, and it is defined as follows, according to [3]: 

Definition 3 (Sufficiency). The statistical model <§ = (0, A,a) is said to be 
sufficient for J? = (0, A', /3), in formula S >- J?, if and only if there exists a 
A' | x |A| transition matrix M, i.e. a matrix of non-negative numbers M$\s with 
J2s'eA' Mgij = 1 for all 6 £ A, for which q s = Mp e , for all € 0. 

The Blackwcll-Sherman-Stein (BSS) theorem states the following important 
equivalence relation: 

Theorem 1 (BSS Theorem [31 [HE]). Given two statistical models § = (0, A, a) 
and & — (0, A', j3), governed by the same parameter set 0, § >- & if and only 

3 The formulation in quantum theory 

In what follows, we only consider quantum systems defined on finite dimensional 
Hilbert spaces "K. We denote by H(JC) the set of all linear operators (identified 
with their representing matrices) acting on JC, and by S(3~C) the set of all density 
matrices (or states) p £ fi-(IK), with p > and Tr[p] = 1. The identity matrix 
will be denoted by the symbol 1, whereas the identity map will be denoted by 
id. 

Most of the concepts used here are introduced and rigorously formalized in 
Refs. [TT] and [T^]. For reader's clarity, however, we will report the definitions 
we need, in a simplified fashion. According with [TT] (see also Remark [JJ, we 
adopt the following definition: 

Definition 4 (Quantum Statistical Models). A quantum statistical model is 
defined by a triple R = (0, JC, p), where is a (finite) parameter set, "K is a 
(finite dimensional) Hilbert space, and p = (pe]0 £ 0) is a family of density 
matrices in S(J£). A quantum statistical model R is said to be abelian when 
[p 8 ,P8>] = 0, for all 0,0' G ©■ 

Definition 5 (POVM's). For any (finite) decision set X = {?}, a positive- 
operator-valued X-measure (X-POVM) P x on the Hilbert space !K is a family 
(P l ;i £ X) of operators P l £ t{%), such that P l > for alH £ X and £ l6X pl = 
1. From now on, the superscript X will be dropped when clear from the context. 

Definition 6 (Quantum Statistical Decision Problems). A quantum statistical 
decision problem is defined by a triple (R, X, £), where R = (0, !H, p) is a quan- 
tum statistical model, X is a (finite) decision set {i}i^x, and / : 8x1- >]Risa 
payoff function. The choice of a strategy for the problem (R, X,£) corresponds 
to the choice of a X-POVM P = (P l ; i £ X) on "K. The corresponding expected 
payoff is computed as 

s q (R,X,^,P) := £^(0,z)Tr[ Pe P% (6) 
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The maximum expected payoff for the decision problem (R, X, £) is defined as 
$ q (R,X,£) :=max-i-]r^(MTrb e P% (7) 

Notice the use of the subscript "q", for "quantum", to distinguish the expressions 
above from their classical analogues appearing in (U]) and ([5]). 

Given two quantum statistical models R = (0,3~C, p) and S — (0,!K',er), 
governed by the same parameter set 0, but with different Hilbert spaces 'K 
and W and different families of quantum states p = (pg G S(3-C);9 G 0) and 
er = (ag G S(!K'); 9 G 0), the following partial ordering is introduced: 

Definition 7 (Information Ordering). A quantum statistical model R = (0, !H, p) 
is said to be always more informative than S = (0, er), in formula R D S, if 
and only if, for every finite decision set X and every payoff function I : x X — >• 
]R, $ q (R,X,^) > $ q (S,X,^). 

In other words, R is said to be more informative than S if every expected pay- 
off attainable in the problem (S, X, £) is also attainable in the problem (R, X, £). 

Remark 2 . We stress once more that the information ordering relation between 
quantum statistical models introduced above does not depend on the a pri- 
ori distribution on used to compute the expected payoff ([7]). One could in 
fact adopt an information ordering for quantum statistical models analogous to 
that introduced in Definition [TJ and prove that the two ordering relations are 
equivalent. This is due to the fact that Proposition [T] is valid also for quantum 
statistical models. 

Remark 3 (Quantum-Classical Correspondence). Given an abelian quantum 
statistical model R = (0,!K, p), it is always possible to construct, from R, a 
(classical) statistical model £r = (0, Ak,Q p ) that is completely equivalent 
to R, in the sense that, for every finite decision set X, every payoff function 
I : x X -> R, and every X-POVM P on "K, there exists a X-r.d.f. on A M 
such that s((?r,X,^, 0) = s q (R,X,^, P). Such a correspondence is obtained by 
first introducing a sample set Ajf = {5} with |Ajf| = dim J{, so that any or- 
thonormal basis for Jf can be indexed by A^c. Then, since all density matrices 
pe are pairwise commuting, an orthonormal basis {\<ps) £ ^}sgA k for 'K exists, 
with respect to which all pg are simultaneously diagonal. Finally, the family of 
probability distributions a p = (pg;9 G 0) on Aj^ is defined according to the 
relation pg(5) :— {ips\pe\fs), f° r all 8 € A^ and 9 G 0. Then, it is easy to 
check that the statistical model <Sr = (0, Am, ct p ), obtained in this way from 
R = (0, p), is completely equivalent to the initial quantum statistical model 
R, in the sense explained above. This in particular implies that, for every finite 
decision set X and every payoff function £ : x X — > ]R, $(4, X, £) — $ q (R, X, £). 

Conversely, given a (classical) statistical model § — (0,A,a), it is al- 
ways possible to construct an equivalent abelian quantum statistical model 
Rg — (OjIKajPcJ, by introducing a Hilbert space JfA, with Aiva'K^ = |A|, 
and a family p a = (pg;9 G 0) of diagonal density matrices on JCaj de- 
fined by the relation pg = J28eAPe(^)\ l Ps){^s\, where {\ips) G M A }<5eA is 
any orthonormal basis for 5£a- Also in this case, it is easy to check that 
the quantum statistical model Rg = (0,!Ka, P a ), obtained in this way from 
£ = (0, A, a), is completely equivalent to the initially given statistical model 
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£ , in the sense that, for every finite decision set X, every payoff function 
f : 6 X I -> R, and every X-r.d.f. (j> on A, there exists a X-POVM P on K A 
such that s q (R^, X, £, P) = s(<?, X, £, 4>). In particular, $ q (R,?, X, i) — $(<?, X, £). 

These ideas can be compactly re-expressed as follows: 

Postulate 1 (Correspondence principle). Classical statistical models are iden- 
tified with abelian quantum statistical models, and viceversa. 

A quantum statistical model R involves a parameter set and a Hilbert 
space "K. In a sense, then, a quantum statistical model constitutes an asym- 
metric structure, where a quantum system carries information about a classical 
parameter. It is useful hence to provide a notion for a "fully quantum" informa- 
tion structure. This can be done as follows: given a finite parameter set O, let 
"Kq be a Hilbert space such that dim JCe = | €> | , i.e., such that that there exists 
a complete set of orthonormal vectors {\fe) £ Jfejeee, labeled by 9, which 
form a basis for "K®. For the sake of notation, let us denote \(fie) simply by 
\0). Then, each quantum model R = (6, 5f, p) defines a corresponding bipartite 
quantum state 

Pab--=T^Y,\°)^®Pb, (8) 
' ™ ' see 

where 'Ka — !Hs = !H, and p B = p$. The particular "classical-quantum" 
structure of the state given in (0) reflects the above mentioned "hybrid" struc- 
ture of a quantum statistical model. Instead, by allowing pab to be an arbitrary 
bipartite state, we arrive at the following definition: 

Definition 8 (Quantum Information Structures |15j). A quantum information 
structure Q AB is defined as a triple {'Ka, K b , Pab), where 3~Ca and "K B are finite 
dimensional Hilbert spaces, and pab £ ^{^A ® K B ). 

The notion of quantum information structure is hence the "fully quantized" 
analogue of a quantum statistical model. In the same way in which a quantum 
statistical model can be used to define a quantum statistical decision problem, 
a quantum information structure can be used to define a quantum game0 as 
follows: 

Definition 9 (Quantum Statistical Decision Games [15j). A quantum statistical 
decision game is defined as a triple {Qab> ^a)' wnere Qab = ffl-A, Kb, Pab) 
is a quantum information structure, X is a (finite) decision set {i} i& x, and 
is a family (0 A ;i <E X) of self-adjoint payoff operators O l A G SJ^Ka)- (From 
now on, the superscript X in will be dropped when clear from the context.) 
The choice of a strategy for player B corresponds to the choice of a POVM 
P b = {Pb'i * G X) on The corresponding expected payoff is computed as 

s q (g AB , X, A , Pb) := £ Tr [{0\ ® P%) Pab] ■ (9) 

The maximum expected payoff is given by 

$ q (g AB ,X,0 A ) :=max^Tr[(0\® P B ) p AB ] . (10) 

2 In the very specific sense given in Ref. [151 . 



7 



The following definition was introduced in [15j as a very natural analogue of 
Definition [2J 

Definition 10 (Information Ordering). Given two quantum information struc- 
tures Q AB = {'K a ,'Kb,Pab) and q AB , = fit A , 3i B > , <j AB , ) , g AB is said to be 
always more informative than c, AB ,, in formula, 

Qab^ac,ab>, (11) 

if and only if, for every finite decision set X and every family of self-adjoint 
payoff operators A — (O a ; i 6 1) on "Kai 

$q(?AB> Oa) > $q(c,AB'i X> Oa)< (12) 

Remark 4. In analogy with Remark [21 here we note that any quantum infor- 
mation structure Q AB = fit a, JCb, Pab), for which a decomposition like that in 
Eq. (JSJ exists, naturally induces a corresponding quantum statistical model R g = 
(0, JCbj (jO^ j $ £ where the states p e B are those appearing in |S]). Moreover, 
any quantum statistical decision game (p^, X, Oa) built upon such a classical- 
quantum structure Q AB is completely equivalent to a quantum statistical de- 
cision problem (R g ,X, Iq), in the sense that $ q (g AB ,'X,0 A ) = $ q (R g ,X,£o)j 
where the payoff function io is defined by 1(6, i) :— (8 a \O a \9 a ), with the vec- 
tors \8 a) being the same as in 

For the reader's convenience, we end this section by summarizing the con- 
tents of Remarks [3] and Q] as follows: 

1 . the most general notion is that of quantum statistical decision games over 
quantum information structures; 

2. quantum statistical decision problems over quantum statistical models are 
equivalent to quantum statistical decision games over hybrid classical- 
quantum information structures; 

3. classical statistical decision problems over statistical models are equivalent 
to quantum decision problems over abclian quantum statistical models. 

In other words, quantum information structures contain quantum statistical 
models (as hybrid structures), which, in turn, contain classical statistical mod- 
els (as abelian models). For this reason we will first formulate our results for 
quantum information structures: quantum statistical models and classical sta- 
tistical models will be considered afterwards, as particular cases. 

4 Sufficiency conditions for quantum informa- 
tion structures and statistical morphisms 

In the previous section we extended the notion of information ordering to quan- 
tum statistical models and quantum information structures, depending on their 
"information value" in quantum statistical decision problems and quantum sta- 
tistical decision games, respectively. In the following we will carefully define 
what it means that a quantum information structure is sufficient for another. 
In order to do this, we will need to consider a formalism which is slightly more 
general than the one we used before. 

We begin with the following definition: 
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Definition 11 (State Spaces). The state space S of a quantum system denned 
on a Hilbert space "K is a non-empty subset of ®(?t), containing all possible 
physical states of the system. 

Remark 5. Usually, the state space S coincides with the set S(JC) of all possible 
density matrices acting on !K. However, there are cases in which the states 
accessible to the system form a proper subset of G(IK), for example, when the 
system is known to obey a conservation law. For later convenience, we keep our 
definition of state space as general as possible. This is also the reason why, in 
the above definition, there is no assumption about the convexity of the state 
space, as we do not need such assumption in general (even though, in many 
physically relevant situations, that would seem rather natural). 

Definition 12 (Effects and Tests). An operator X G j£(J£) is called an effect on 
a state space S (defined on !K) if and only if there exists an operator P G £(5t), 
with < P < 1, such that Tr[Xp\ = Tc[Pp], for all p 6 S. 

For any (finite) decision set X = {i}, a family (M l ;i G X) of operators 
AP G JL(JC) is called a X-test 93i 3c on a state space S (defined on J£) if and only 
if there exists a X-POVM P x = {P' l ;i G X) on IK with Tr[Af 4 p] = Tr[P l p], 
for all i £ 1 and for all p G S. Any such POVM P x is said to realize the test 
SJi^ on S. From now on, the superscript X will be dropped when clear from the 
context. 

Remark 6. For a given state space S and a given decision set X, two families 
9Jt = (M t ;i G X) and s Jl = (N l ;i G X) of operators in €(?£) are statistically 
equivalent on S, in formula 9JI ~s 9i, if and only if Tr[A:P p] = Tr[A^ p], for 
all i G X and all p G S. For any family 931 = {AP;i G X), let [9% be the 
corresponding equivalence class induced by ~g. Any X-test on © can hence be 
thought of as the equivalence class of some X-POVM on Jt. 

Remark 7. A second, more physically motivated way to think of tests is the 
following: X-tests on a state space S are those affine mappings, from (3 to 
probability distributions on X, which can be physically realized as quantum 
measurements. This is guaranteed by requiring, in the definition of test, the 
existence of at least one POVM that is statistically equivalent to it: in fact, 
all physically realizable quantum measurements give rise to a POVM, and any 
POVM can be physically measured |21| . Such a restriction in the definition of 
tests is meaningful only if there exist cases of affine mappings from a state space 
S to probability distributions on a decision set X, which cannot be realized by 
any POVM. If the state space is the totality of states S(J{), then, all such 
affine mappings are in one-to-one correspondence with POVM's, and there is 
no need to introduce further definitions. However, in the general case in which 
S G S(J{), the distinction between tests and "unphysical" affine mappings 
become relevant, and Definition WI\ is necessary. 

We are now in the position to rigorously introduce the idea which will be 
the basis of our analysis: 

Definition 13 (Statistical Morphisms). Given two state spaces Sj n (defined on 
a Hilbert space IK; n ) and S out (defined on a Hilbert space IH out ), we say that a 
linear map L : £(Jtj n ) — > j2.(J£ out ) induces a statistical morphism L : S; n — > S out 
if and only if the following conditions arc both satisfied: 
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1. for every p £ S In , L(p) 6 S out ; 

2. the dual transformation L* : -t(J£ out ) — > 4(J£i n ), defined by trace dualitjH, 
maps tests on S ut into tests on Sj n . 

Remark 8. Notice that the notion of statistical morphism, introduced in Defi- 
nition [T21 is in principle strictly weaker than the notion of positive map, which 
is a linear map that transforms positive operators into positive operators. In 
fact, given a positive operator P < 1 on 3~C ut, the operator L*(P) might have 
negative eigenvalues, and yet be an effect on Si n , according to Definition [T21 On 
the contrary, a linear, trace- preserving, positive map from <t(J{i n ) to €(!H ou t) 
always constitutes a statistical morphism. An open question is whether any 
statistical morphism can always be extended to a positive map. Indications 
that this might not be true in general are provided in Ref. [22], Corollary 10. 
However, at the moment of writing, an explicit counterexample is not available. 

Definition 14. Given a quantum information structure Q AB = {K A , K b , Pab), 
the associated state space <B b (q ab ) — °f physical states of the subsys- 

tem B is defined as 



Tr A [(P A ®t B )p AB ] 



Tr[(P A ® t B )p 



AB 



Pa e 1{-K A ) : < P A < t A \. (13) 



Remark 9. The state spaces associated with a given information structures turn 
out to be convex state spaces. This can be easily verified by direct inspection. 

Remark 10. From now on, it is convenient to think that, in Eq. (|10|). the max- 
imum over POVM's Pg on "Kb is replaced by a maximum over tests DIb on 
<5 B (g AB ). Such a replacement, which is formally convenient, is quantitatively 
irrelevant, since it does not affect the value of the maximum expected payoff, 
nor it modifies the information ordering relation introduced in Definition 1101 

We are now able to rigorously define the notion of sufficiency (in a sense anal- 
ogous to the one used by Blackwell in [6]) for quantum information structures, 
in its two variants: sufficiency and m-sufficiency. 

Definition 15 (Sufficiency and m-sufficiency). Given two quantum information 
structures Q AB = (3< A , 'Kb , Pab ) and c, AB > = (JCa , 3tfl' > °ab' ) , we say that q AB 
is m-sufficient for C, AB i, in formula 

Qab^Sab', ( 14 ) 

if and only if there exists a statistical morphism L B : & B (g AB ) — > (B B >(c, AB ,) 
such that 

oab' = (id.4 ® £b)(pab)- (15) 
We say that Q AB is sufficient for c, AB , , in formula 

Qab^Sab>, (16) 

if and only if there exists a completely positive, trace-preserving map £ B : 
1(3< B ) -> 4,(5C B ') such that 

OAs- = (idA ® £b)(pab)- (17) 



3 For any operator X G £(M ou t), L*(X) £ 4(W in ) is defined by the relation Tr[£*(X) Y] 
Tr[X -C(Y)], for every Y g €(M in ). 
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Intuitively speaking, the idea of sufficiency is related with the fact that 
the transformation can be actually performed physically, as an open evolution. 
On the contrary, the notion of m-sufficiency introduced here just assumes the 
existence of a formal statistical procedure to map one strategy into another. 

5 Extension theorems for statistical morphisms 

Even if the notion of statistical morphism is weaker than that of positive map, 
two famous extension theorems for positive maps, proved by Choi |18j and 
Arveson [TO], can be generalized to statistical morphisms as well. 

Definition 16 (Complete State Spaces). A state space S on 5f is called com- 
plete for £(JC) if and only if it contains (dim "K) 2 linearly independent density 
matrices. 

Definition 17 (Composition of State Spaces). Given two state spaces S Q (on 
"Koi) and S^ (on !Hp), we define the set 

S Q x <5p := {cr Q <g> 773 \a a £ <S a , r^S^}. (18) 

An operator X £ H(3-C Q (g> !Hp) is an effect on S a x S^ if and only if there exists 
an operator P £ fL{"K a ® Jip), < P < l a <g> lp, such that Tr[A (cr Q ® rp)] = 
Tr[P (o~ a ® T]g)], for all cr Q £ S Q and rg £ S^. In the same way we extend the 
notion of tests. Notice that effects or tests on S a x S^ need not be factorized. 

Proposition 2. Given two state spaces Si n and S out , defined on !Ki n and !K ou t, 
respectively, and a third auxiliary complete state space So, defined on !Ko — 
Jfout, suppose that the linear map id ® £ : Jl(5{o) ® fi.(Jf; n ) — > £(J£o) ® -S-(^out) 
induces a statistical morphism from So x Sj n £0 So x S out . Then, there exists 
a completely positive, trace-preserving map £ : £.(Jfj n ) — > £(J£ out ) such that 

£(*)=£(*), (19) 

for all a £ Sj n . 

Proof. Let (i? 4 )^, where e? = dimJfo = dimM ou t, be the POVM consisting of 
the d 2 generalized Bell projectors acting on !Ho <8» Jt ut- By trace-duality: 

Tr [P/(w®£(cr))] = Tr [(id® (w ® cr)] , (20) 

for all a £ Si n and all w £ So. The fact that id ® £ is a statistical morphism 
implies, by definition, that the operators ((id®£*)(-B l ))f =1 , even if not positive, 
yet induce a test on So x Si n . In other words, there exists a POVM (B l )f =1 on 
3~Co ® 3~Cin such that 

Tr [(id ® £*)(£*) (w ® it)] = Tr [JET (w ® ct)] , (21) 

for all a £ S; n , all w £ So, and every i. Due to the assumption that So is 
complete, there always exist d 2 states in So which form an operator basis for 
£(JCo). We can then extend Eq. (|2ip by linearity and obtain that, in fact, 

Tr [B l {X ® L(a))] = Tr [B l (X ® tr)] , (22) 
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for all a € G; n , all X 6 H(!Ko), and every i. 

Using the POVM (B t )f =1 (whose existence we proved above), we now con- 
sider the identity (via teleportation): 



a 

= Tr^ 7 [(K ® 1^) (1 Q ® fl£ 7 ) (*+, ® £> 7 )) ((t£)t ® V) 



= ^ Tr^ 7 [(E/j ® 1^) (1 Q B%) (v+p ® cr 7 ) ((t£)t ® 1^) 



(23) 



where \I f+ = d 1 ^ . =1 ® is a maximally entangled state on !K® 2 

and (U l )f =1 is an appropriate family of unitary matrices on 'Kq. The relation 
above holds for all a £ Si„ C S(JCi n ). However, it is clear that the last term in 
Eq. (22J) can be extended, by linearity, to a completely positive trace-preserving 
map £ : 4(5{ in ) -> fc(J£o) = £(5C out ) defined as: 



£(p) 



: ]T Tr^ 7 [([/* ® 1^) (l a ® B* 7 ) ® p 7 ) ((I7j)t ® 1^) 



i=l 



(24) 



for all p € S(Jfi n ). This hence concludes the proof that a completely positive 
trace-preserving map £ : £(5£ in ) — > £(J{ ou t) exists, such that 



for all a £ Si n . 



(25) 
I 



Another important case is when the output state space S out is abelian, 
namely, [p,cr] = 0, for all p,o~ £ S ut- This condition, in particular, implies 
that there exists an orthonormal basis {\i)}f = i for M out that diagonalizes all 
9 £ ©out- 

Proposition 3. Given two state spaces S; n and S out , defined on 5£i n and 
3^out; respectively, let S out be abelian. If there exists a linear map L : £(J£i n ) — > 
£(5t ou t) inducing a statistical morphism from Sj n to S ut ; then there exists a 
completely positive, trace-preserving map £ : i(J£j n ) — > £(5{ ou t) such that 



(26) 



/or a/Z p € S; 



Proof. For <i = dim J{ out , let {|«)}f =1 be the basis for !H out that simultaneously 
diagonalizes every a £ S ut, and denote by IT £ £(Jf out ) each projector \i)(i\. 
Then, for any a £ S out 



i=l 



Next, we note that, by defition of the trace-dual map L* 
Tr [IT = Tr[£*(IT) p] , 



(27) 



(28) 
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for all p € The fact that £ is a statistical morphism implies, by definition, 
that the operators (L* (IF))f =1 , even if not positive, yet induce a test on Sj n . 
In other words, there exists a POVM (IF)f_i such that 

Tr [£-(11*) p] = Tr [ftp] , (29) 

for all p £ Sj n and every i. 

Using the POVM (IP)f =1 (whose existence we proved above), we recall 
Eq. (|2~T|) above and consider the identity: 



£(p)=^Tr[ir£(p)] IT 



i=l 
d 



(30) 



The relation above holds for all p £ S; n C S(Mi n ). However, it is clear that 
the last term in Eq. (|30[) can be extended, by linearity, to a completely positive 
trace-preserving map £ : t(K in ) — > fi.(J{ ou t) defined as: 

d 

£(p) :=E Tr [n l p]n 4 , (31) 

1=1 

for all p S S(Mi n ). This hence concludes the proof that a completely positive 
trace-preserving map £ : H(!Kj n ) — > £(J{ ou t) exists, such that 

E(p)=L(p), (32) 

for all p G Si n . I 



6 A fundamental equivalence relation 

In this section, we prove our main result: 

Theorem 2. Given two quantum information structures q ab ~ {"Ka, Pab) 
and c, AB , = {"Ka^B' ,o-AB'), 

Qab Sab' ^ Qab d a c,ab>- (33) 

Moreover, the linear map inducing the statistical morphism between Q AB and 
c,ab' can always be chosen to be trace-preserving on the whole space SLCKb)- 

Remark 11. Shmaya, in Remark 7 of his Ref. [15] . asks the question whether 
Qab -'A ^ab 1 i s equivalent to the existence of a positive trace-preserving map T 
such that (Jab' = (}d®y)pAB- The above theorem shows that Shmaya's question 
is equivalent to asking whether any trace-preserving statistical morphism always 
admits a trace-preserving positive extension (about this point, see Remark [5] 
above). 

For the sake of clarity, we divide the proof of Theorem [5] in two parts. The 
first part is a lemma proved by Shmaya in Ref. [15] , as a direct consequence of 
the Separation Theorem for convex sets (see, e. g., Ref. [2"U]). 
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Before stating the lemma, we introduce the following notation: given a quan- 
tum information structure Q AB — {'Ka^b, pab), a decision set X, and a test 
9Jte = (M B ;i £ X) on the state space &b{q A b)i we define the following opera- 
tors: 

Pa\w ■= Trs [(1a ® M B ) pab\ , (34) 
for each i £ I, In Eq. (|34p . we can replace the family of operators 9Jis by 
any other family of operators which is statistically equivalent (in the sense of 
Remark to SDIb on ©^(g^Jfl- In particular, we can replace the operators 
M B by the elements P l B of any POVM P B = (P B ; i € X) on "K B realizing the 
test 93i s on Q b (q ab ). 

We are now ready to state the following: 

Lemma 1 (Shmaya [H]). Given two quantum information structures Q AB = 
(3{ A ,3< B ,p AB ) and c, AB , = (JC A ,3{. B ,,aAB>), if Q AB Da W> then, f° r an V 
finite decision set X and any test 3l B > — (N B ,; i £ X) on Qb'(c,ab')' there exists 
a test 9Kb = (^M B ;i G X^ on & b (q ab ) such that 

p\\m = ( 35 ) 

for all i £ 1. 

Proof. For the reader's convenience, we reformulate here Shmaya's proof ac- 
cording to our notation. For any finite decision set X, let us consider the set 
Ca(q ab , X) of all |X|-tuples 

(pA\w > P%w > ' ' ' ) Pa\m 

(36) 

where 9Jls varies over all possible X-tests on <Z b (q ab ). Clearly, Ga{q ab ,%) 
is a closed and bounded convex subset of the (real) linear space of |X|-tuples 
(T l ;i £ X) of self-adjoint matrices on 3~Ca, since it inherits its structure from 
the convex structure of the set of X-tests on (3 b (q ab ). 

The proof then proceeds by reductio ad absurdum. Suppose in fact that, for 
some decision set X, there exists a test Sfl B , = (N B , ; i S X) on <Z B i(c, AB ,) such 
that the corresponding |X|-tuple 

( CT A|si! . a A\-n > • • ' . a A\l\) $ g a(q ab , X). (37) 

Then, by the so-called Separation Theorem between convex sets (see, e. g., 
Ref. [20]) Corollary 11.4.2), there exists a |X|-tuple of self-adjoint operators 
(T A ; i £ X) on ^K A , such that 

max £ Tr [p A{m t] < £ Tr [a\ ]3i f<] , (38) 

where the maximization if taken over all tests S)l B — (M B ; i £ X) on <5 b (q ab ). 
This contradicts the assumption Q AB Da ^ab'- 1 

4 This fact can be proved by noticing that the joint probability distribution py x{j,i) : = 
Tr[(_F^ (g) Mg) pab], where (F.; j G y) is an informationally complete POVM on Ji A , equals, 
for all j 6 J and all i G X, that computed as Tr[(.Fj^ ® Xg) Pab], whenever (Xg\i £ 
3») ^S b (q ab ) (Mg',i 6 X). By the completeness of (F A ;j G V), we conclude that, in fact, 
Tr B [(l A ® Af<) p AS ] 
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Proof of Theorem^ One direction of the theorem, that is Q AB >- m q AB , => 
Q AB Da ^ab'i simply follows from the definition of m-sufficiency given in Defi- 
nition 1151 

Only the converse direction, i.e. Q AB Da c,ab' =^ Qab ^AB' j is hence 
non trivial. In order to construct a statistical morphism Lb ■ &b{Qab) 
Qb'(c,ab')i consider the decision set X = {1,2,- •• , (dimJfs/) 2 } and an in- 
formationally complete X-POVM (F B ,;i G X) on 'Kb 1 , with self-adjoint dual 
operators {9 B ,;i G X). The following identity holds 

7V=^Tr[I> F B ,]6 B „ (39) 

for all operators Tb 1 G 4(!Kb<). By linearity then 

Tab 1 = Tl 'B' [ t ab' {1a ® F B , )] ® 9 B , , (40) 

for all operators Tab' G ^i^A ® 9fs')- 

Let us now put, in Eq. (|4"0")) . Tab' = ctab'- Since we assume Da Sab'i 
by Lemma [TJ there exists a X-POVM {F B ; i 6 X) on JC^ such that 

Tr s [ P ab {1a ® J%)] = Tr B , [<t ab , (1 a <g> , (41) 

for all i € X. Fixed any such POVM {F B ;i G X), we define a linear map 
£ B : £(J{ B ) — > Jt(J£ B /) via the relation 

£ s (T B ):=^Tr[T flj P B ]^„ (42) 

for all operators Tb G 4(JCb). Equivalently, the linear map Lb can be defined 
by the relations L* B ,{F B ,) = F B , for all i G X. This guarantees £^,(1_b') — 1b, 
i.e. the linear map Lb is trace-preserving. Now, we have to check that the linear 
map Lb ■ S.{0-Cb) — > Q-ffiB') so constructed in fact satisfies both conditions in 
Definition 1131 and induces a statistical morphism from &b{Qab) ^° ^B'{c,ab')- 
We begin by noting that, as a consequence of Eqs. (|4*U)) . (jlTj) . and (|4*2")) . 
(icU (g> Lb){pab) = o~AB' ■ This can be shown as follows: 

0~AB' = Tl 'S' [o-AB' 0U ® ^B')] ® 

= H Trs ^ A ® **b)] ® °B' (43) 

= {\d A ®L b ){pab)- 

This also ensures that Lb{&b{q A b)) — &b'{$ab')- 

Let now X be an arbitrary (finite) decision set, and let 91b' '■= {N B i \ ! 6 1) 
be any X-test on &b'{c,ab')- We will now check, by applying Lemma[TJ that the 
operators X B := L*{N l B ,) indeed constitute a test on Qb{q A b)- The proof goes 
as follows: for every lob G ®-b({?ab)> ^ e ^OKa) be the positive operator 
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such that cub = Tr^ [(R A <8> Is) pab}- Consider now, for all the trace 

Tr[X* B cu B ] = Tr [{R» A ® X B )p AB ] 

= Tr[R A ' Ti B [(1 a ®X* b ) p AB ]] 

= Tr [R U A Tr s [(1a ® L* B {N B ,)) p AB ]] (44) 
= Tr[R% Tr B > [(1 A ®N B ,) (\d A ® L B )(p AB )]] 
= Tr [R% Tr s , [{t A ® N B ,) a AB >]] . 



Lemma [T] provides the existence of a POVM (P B ; i G X) on "K B such that 



Tr ? 



(1a ® Pfi) PAB = Tr^' [(1a ® A?fl/) CTAB'] 



(45) 



for all % e X. Plugging such POVM into Eq. (|3i|l. we obtain 

TrLY^ w B ] = Tr [i# Tr B , [(1a (8 <tab']] 



= Tr 
= Tr 
= Tr 



R\ Tr 

{Rf A ®P B ) PAB 

P % B ^B 



(1a (8 P' B ) pab 



(46) 



for all i 6 1. Since this holds for every ojb £ ®-b(&ab)> we P rove d that, for 
any finite X and any X-test (N B ,;i € X) on S>b'(^ab')j the operators X B := 
L* B {N B ,j indeed constitute a test on Q b (q ab ). This shows that £^ is a well- 
defined statistical morphism from <5 b (q ab ) to Sb'(^ab')i as requested. £ 



7 The Blackwell-Sherman-Stein theorem in the 
quantum case 

The BSS theorem (see Theorem[T]) is about the comparison of classical statistical 
models. According to Postulate [TJ however, we can actually identify the notion 
of classical statistical models with that of abelian quantum statistical models, 
so that the BSS theorem becomes a statement about comparison of abelian 
quantum statistical models. In this sense then, we call a "non-commutative (or 
quantum) BSS theorem" a statement characterizing equivalent conditions for 
the comparison of general quantum statistical models, recovering Theorem [Tj in 
the abelian case. In the following, we will show how Theorem [3] can be used to 
prove such a generalized statement. 

Definition 18. Given a quantum statistical model R = (9, IK, p), the associ- 
ated state space S(R) C S(J£) is defined as the set of states S(R) = {pe ■ e 
6}. 

Remark 12. As already noticed in Remark 1 101 it is irrelevant whether the max- 
imum in Eq. Q is taken over POVM's on % or over tests on S(R). For what 
follows, however, it is convenient to consider the expected payoff as maximized 
over tests, rather than POVM's. 
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As it happens for quantum information structures (see Definition 115)) , also 
for quantum statistical models we have two different notions of sufficiency: 

Definition 19 (Sufficiency and m-sufficiency) . The quantum statistical model 
R = (9, !K, p) is said to be m- sufficient for S = (9, !K', <x), in formula 

R y m S, (47) 

if and only if there exists a statistical morphism £ : S(R) — > S(S) such that 

ag=L{ Pe ), We 9. (48) 

The quantum statistical model R = (9,3"{,p) is said to be sufficient for S = 
(9, J£', <x), in formula 

R >- S, (49) 

if and only if there exists a completely positive, trace-preserving map £ : fi.(!K) — >• 
£(5C') such that 

a 9 = E(p e ), W e 8. (50) 

Theorem [51 via the correspondence exhibited in Eq. (JSJ), directly implies the 
following: 

Theorem 3 (Non-commutative BSS Theorem). Given two quantum statistical 
models R = (9, 5C, p) and S = (9, 5C', <x), 

R >- m S R d S. (51) 

Proof. Given the quantum statistical model R = (9, "K, p), let us construct the 
quantum information structure Q^ B = (JCa,!Kb, Pab), as done in Eq. ((§]). Let 
us repeat the same construction (using the same basis for 'Ka — 5te) to obtain 
q s 4B , = (%a> ^b>i o~ab) from S = (9, %',&). Keeping in mind Remark^ it is 
easy to verify that 

R ^ m S ^ q ab >- m c, s AB ,, (52) 

and that 

RDS q r ab Da4- (53) 
We then obtain the statement by direct application of Theorem [21 | 
Further, by applying Proposition [31 we obtain the following: 

Proposition 4 (Semi-classical case). Given two quantum statistical models R = 
(9,!K, p) and S = (9, JC',<r), if S is abelian, 

R >~ S ^ R D S. (54) 

Proof. By definition, S is an abelian quantum statistical model if and only if 
S(S) is an abelian state space. Then, due to Proposition [31 we know that, 
whenever S is an abelian quantum statistical model, R >- m S if and only if 
R >- S. With these remarks at hand, the statement is finally proved as a simple 
consequence of Theorem [3J above. | 
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Notice that Proposition 2] is still more general than the BSS theorem, since 
commutativity is required only for S, whereas the classical case is equivalent to 
the situation in which both R and S are abelian. Proposition|3]hence describes a 
"semi-classical" scenario. In the case in which also R is an abelian quantum sta- 
tistical model, it is easy to prove that any completely positive, trace-preserving 
map £ such that ag = £-(pg) can be in fact written as a transition matrix Me, 
mapping the vectors pg of eigenvalues of pg into the vectors qg of eigenvalues of 
ag, for all 9 £ 9, in complete accordance with the notion of sufficiency used in 
the BSS theorem [TJ We leave the proof of this to the reader. 

Next, we show that Theorem [21 together with Proposition [21 provides an 
equivalent characterization of the sufficiency relation >- for quantum statistical 
models. We first need the following definitions: 

Definition 20 (Composition of Quantum Statistical Models). Given any two 
quantum statistical models R = (9, IK, p), with p = (pg;6 £ 9), and T = 
(3, DC, r), with r = (rg; £ £ 3), the composition T x R is defined as the quantum 
statistical model (3 x 9, % ® "K, r x p), where r x p := (tj ® pg; £ 6 3, 9 e 9). 
Moreover, S(T xR) = S(T) x S(R). 

Definition 21 (Complete Quantum Statistical Models). A quantum statistical 
model T = (3,3C,r) is said to be complete if and only if S(T) is a complete 
state space. 

Proposition 5 (Equivalent condition for sufficiency). Given two quantum sta- 
tistical models R = (9,!H, p) and S = (9, 3i',cr), the following are equivalent: 

1. 

R y S; (55) 

2. 

T x R d T x S, (56) 
for every auxiliary quantum statistical model T = (3, 3C,t); 

3. 

T x R d T x S, (57) 
for some complete quantum statistical model T = (3, 3C, r) with % = !H'. 

Proof. The implications "1 2" and "2 =^ 3" are trivial. In order to prove 
the implication "3 1", let us consider an auxiliary quantum statistical model 
T = (5, Jf', t), such that S(T) is complete for £(?{'), according to DefinitionfTBl 
The condition TxRdTxS implies, by Theorem^ the existence of a statistical 
morphism L : S(T x R) — > S(T x S) such that L(t^ (g) pg) = ® ag, for all 
£ 6 3 and all 9 e 9. By the completeness of G(T), this implies that the linear 
map L : (g) £.(S£) C(JC') (g> £(?£') must in fact have the form id <g> L' . 

We are hence in the position to apply Proposition [21 which proves the existence 
of a completely positive, trace-preserving map £ : £.(?{) — > fi.(!K') such that 
ag = E(pg), for all 9 G 9, i.e. R y S. | 

The corollary above makes it apparent that complete positivity is always 
related with the possibility of extending a quantum system (in this 
quantum statistical model) by composing it with an auxiliary one. 
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8 Sufficiency of quantum information structures, 
without entanglement 

We begin this section with the following definition: 

Definition 22 (Composition of Quantum Information Structures). Given two 
quantum information structures Q Ag = {"Ma, ^-b,Pab) and ul>xy — {Jix> ^y,^xy 
the composition q ab ®tuxr is defined as the triple ('Ka ® ^x , ^b ® My , Pab ® 

Remark 13. From Definitions [Til [T71 andl2"2l it simply follows that 

^>by{Qab ® w xy) 2 ^b(Qab) x ^y(wxy)- (58) 

Definition 23 (Complete Information Structures). A quantum information 
structure Wxy = ffix, , &x y) is complete if and only if: 

1. the local state space Qy(wxy) is complete (see Definition ITS)) . and, 

2. for any given linear map Ly '■ &(JCy) — > £(JCy), (idx = ujxy 
if and only if £y = idy. 

Remark 14. In order to explicitly show the existence of a complete information 
structure cu^y" = My, wxy), let us consider the family of information 

structures iv XY = (Mx, JCy,w^y), forp G [0, 1], where dimlKjf = dimJ£y = d 
and w^-y is an isotropic state, that is, 

^XY'-=P*1cY + (l-P)^jjir> (59) 

with ^J^y denoting a maximally entangled state in J£x <g>3"Cy. These states are 
known to satisfy the second condition in Definition [23] for p + [23]. Moreover, 
a simple calculation shows that 



Gy G S(J{y) I , (60) 



{1 
pay + (1 -j>) — 

meaning that, for p + 0, Sy(cu X y) is complete. 
We are now able to state the following: 

Proposition 6 (Comparison of quantum information structures). Given two 
quantum information structures Q AB = (JKat^b, Pab) andc, AB , = (JC^, Mb', o~ab' 
the following are equivalent: 

1. 

Qab^^ab'', (61) 

2. 

[w X y ® Qab] d xa [w X y ® c, A b>] > (62) 
for every auxiliary quantum information structure Wxy = (Jfxi My, Wjy); 

5. 

® ^s] ^xa [A>xy ® W] . (63) 
/or some auxiliary quantum information structure t\> X y = (JCjc, My, ^J^y), 
smc/i i/ia£ ^Jfy is a maximally entangled pure state and < J~Cx — — ^Cb* ?' 
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[<VXY ® Q AB ] ^XA [W X Y ® C,AB>] > ( 64 ) 

/or some auxiliary complete quantum information structure Wxy — fflx, ^y, ^>xy) 
with Jfy — 3~Cb' ■ 

Proof. The implications "1 => 2" and "2 => 3" are trivial. The implication 
"3 =>• 4" follows from the fact that, from Eq. any maximally entangled 

information structure is, in particular, complete. We hence prove only the 
implications "4 =>■ 1". 

Starting from (f6"4")l . Theorem [2] guarantees the existence of a statistical mor- 
phism L Y b ■ >5yb(u>xy ® AB ) Sy B /(cu X y ® Qab') sucn t^at 

ojxy ® cab' = (idxA ® £ys)Ky ® pas)- (65) 

Since wjy is a complete state, Eq. (|65l) implies that the linear map Lyb must 
in fact have the form 

L Y b = idy ® £>b- (66) 

Further, the fact that idy ®Lb is a statistical morphism from *5yb(u>xy®Q A b) 
to toys' (cUxy ® Qab') implies that idy <g> £s is also a statistical morphism, in 
particular, from Sy(cuxy) x &b{q ab ) to Sy(cuxy) x ®b'(Qab')i because of 
Eq. (|58p . Finally, since we assumed that Sy(o)xy) is a complete state space, 
we can apply Proposition [2] to show that, indeed, Q AB >- c, AB ,. | 

Remark 15. In Ref. [T5], the statement "3 <^ 1" is proved. Proposition [5] shows 
that the hypotheses can in fact be relaxed so that only the property of complete- 
ness, rather than entanglement, is required. Let us consider, as an example, the 
set of isotropic states defined in (|55)l . Such states are known to be separable for 

p < g^j- Hence, by fixing a value p„ £ ^0, -j^j > we have that uj^ y ^ s complete, 
induces a complete state space on Y, and, yet, it is a separable state. This fact 
recalls the results of Ref. [23] , where it was first noted how completeness (there 
referred to as "faithfulness") can replace entanglement, although in a different 
contest (namely, quantum process tomography). 

Remark 16. In Remark U we described how quantum statistical models can be 
identified with those quantum information structures, for which a decomposition 
like that in Eq. ([5]) exists. One should hence expect that Proposition [5] implies 
Proposition whenever q ab — ( i K Ai 'Kb 1 Pab) and c, AB , — (JCa,?Cb' ,o~ab') 
can be written in the form of Eq. ([5]). In such a case, indeed, the fourth statement 
of Proposition[S]can be used to re-derive Proposition^simply by considering an 
auxiliary quantum information structure iVxY = (JC^, My,wxy) of the form 



U)XY 

The crucial observation is that the above quantum information structure is 
complete if and only if the corresponding quantum statistical model To, := 
(3,5£y,r), with r = (r^;^ £ 5), is complete. The rest of the proof is left to 
the interested reader. 
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9 Conclusions 



We extended some results from the theory of comparison of statistical models 
to quantum statistical decision theory. This has been done by relaxing Petz's 
definition of coarse-grainings to that of statistical morphisms. By using such 
generalized notion, we introduced comparison criteria for quantum statistical 
models and quantum information structures, which are the direct generaliza- 
tion to a non-commutative setting of the comparison criteria used in classical 
decision theory. The framework we described turned out to be general enough 
to encompass both the classical and the quantum case. We showed how results 
that previously were independent, like the Blackwell-Shcrman-Stein theorem 
for statistical models and Shmaya's result for quantum information structures, 
can be in fact recovered as special cases of a single, unifying comparison theo- 
rem, which also sheds new light on both: the BSS theorem has been extended 
to a quantum-classical scenario, and Shmaya's comparison criterion has been 
strengthened by removing the need of auxiliary entangled resources. 

As a final remark, the reader might have noticed that, as long as the states of 
a statistical theory can be represented by self-adjoint matrices (not necessarily 
positive) of unit trace, the definitions of information ordering and m-sufficiency 
proposed here can be straightforwardly extended to consider such cases as well. 
For such generalized probabilistic theories, an extension of the BSS theorem can 
also be proved, along the same lines described in the present work. 
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A Proof of Proposition CD 



Proposition 1. For any two given statistical models S — (0, A, a) and & = 

(6, A', P),*D & if and only if S D B&ycs 

Proof. The statement can be proved by using the Separation Theorem between 
convex sets [2U] as follows. (Notice that in our case all convex sets are closed 
and bounded, so that we can proceed without paying attention to too many 
technical details.) 

Generally speaking, the convex set Ci C R N is not contained in the convex 
set C2 C R if and only if there exists a point v £ Ci such that v £ 62- Then, 
the Separation Theorem (Corollary 11.4.2 of Ref. [2UJ), applied to the convex 
set C2 and the single-point (hence convex) set {v}, states that, for such v, there 
exists a vector b £ H N such that 

N N 



^b n w n <^b n v n . (68) 



max 

wee 2 

n—l n—l 



Equivalently, we can say that the convex set Ci C R w is contained in the convex 
set C 2 C if and only if, for all vectors b G R^, 



N N 



J2 b n w n > max ^ b n v n . (69) 



max 

n—l n—l 

Moreover, for any given non- vanishing probability distribution ir(n), ^ n w(n) 
1, the convex set Ci C R w is contained in the convex set C2 C R^ if and only 
if, for all vectors b € R*, 

N N 

max V Tr(n)b n w n > max V n(n)b n v n . (70) 

n=l n—l 

This follows from the fact that the above equation has to hold for all b £ R , 
so that the non- vanishing probabilities 7r(n) can be absorbed in the definition 
of b. In particular, there is no loss of generality in considering ir(n) = l/N, for 
all n. 

We now turn to the case of C(<?, X, 1) and C(J^", X, £), choosing the a priori 
probability on as tt (6) = for all 6. Then, for every b G Rl e l, 

, |^| E ^ *> * Q = „ |^| E ^ *> *), 

1 1 eee 1 1 eee 

where the function £ at the left hand side is another payoff function with such 
that £(8,i) — £(9,i)b 6 . In other words, the vector b can be absorbed in the 
definition of the payoff function. This means that, for any finite set of decisions 
X and any payoff function £ : x X -> R, X, i) D e(=T, X, I) if and only if, 
for every payoff function < : 8 x 1 -> R, 

M E # ' x ' J) > W\ E u V; x > I] > (72) 

1 1 eee 1 1 eee 

where the maxima are taken over all possible X-r.d.f. on A and <j)' on A'. 
This, in turns, implies the statement. | 
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